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Abstract 

A robust affine invariant version of Kantorovich’s theorem on Newton’s method, for finding 
a zero of a differentiable vector field defined on a complete Riemannian manifold, is presented in 
this paper. In the analysis presented, the classical Lipschitz condition is relaxed by using a gen¬ 
eral majorant function, which allow to establish existence and local uniqueness of the solution 
as well as unifying previously results pertaining Newton’s method. The most important in our 
analysis is the robustness, namely, is given a prescribed ball, around the point satisfying Kan¬ 
torovich’s assumptions, ensuring convergence of the method for any starting point in this ball. 
Moreover, bounds for Q-quadratic convergence of the method which depend on the majorant 
function is obtained. 

Keywords: Newton’s method, robust Kantorovich’s theorem, majorant function, vector field, 
Riemannian manifold 


1 Introduction 

Extension of concepts and techniques as well as methods of Mathematical Programming from the 
Euclidean space to Riemannian setting it is natural and has been done frequently before; see, 
e.g.,[ii m la na nzi [Ml Si!. The motivation of this extensions, which in general is nontrivial, is 
either of purely theoretical nature or aims at obtaining efficient algorithms; see, e.g., mmm 
ISIllMlllZlllHllMlIll] • Indeed, many optimization problems are naturally posed on Riemannian 
manifolds, which has a specific underlying geometric and algebraic structure that could be exploited 
to greatly reduce the cost of obtaining the solutions. For instance, in order to take advantage of 
the Riemannian geometric structure, it is suitable to treat some constrained optimization problems 
as one of finding the zeros of a gradient vector field on a Riemannian manifolds rather than use the 
method of Lagrange multipliers or projection idea for solving the problem; see UmilSTllMllll!. In 
this case, constrained optimization problems can be seen as unconstrained one from the Riemannian 
geometry viewpoint. Besides, the Riemannian geometry allows to induce new research directions 
so as to produce competitive algorithms; see HSIEIIIMIEH]. In this paper, instead of considering 
the problem of finding the zero of the gradient field on a Riemannian manifolds, let us consider the 
more general problem of finding a zeros of a vector field defined on a Riemannian manifold. 
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On the other hand, the Newton’s method and its variant are powerful tools for finding a zero 
of nonlinear function in real or complex Banach space. Besides its practical applications, Newton’s 
method is also a powerful theoretical tool having a wide range of applications in pure mathematics; 
see [8l[l9l[22l[29l[30l[43]. Therefore, a couple of papers have dealt with the issue of generalization of 
Newton’s method and its variant from Euclidean to Riemannian setting in order to go further in the 
study of the convergence properties of this method. Early works dealing with the generalization of 
Newton’s methods to Riemannian setting include [TOKUllIHllMlIMlEe]. Actually, the generalization 
of Newton’s method to Riemannian setting has been done with several different purposes, including 
the purpose of finding a zeros of a gradient vector field or, more generally, with the purpose of finding 
a zero of a differentiable vector field; see BElElEllTlElliollISllIllIlllMlESlEelETlElllMlEa 
EHl SOI SD sa 06] and the references therein. 

Properties of convergence of Newton’s method have been extensively studied on several papers 
due to the important role that it plays in the development of numerical methods for finding a zero 
of a differentiable vector held dehned on a complete Riemannian manifolds. In 2002 Ferreira and 
Svaiter in m extended the Kantorovich’s theorem on the Newton’s method to Riemannian setting 
using a new technique which simplihes the analysis and proof of this theorem. It is worth mention 
that, in a similar spirit, an extensions of the famous Smale’s theory; see [37], to analytic vector 
helds on analytic Riemannian manifolds were done in 2003 by Dedieu et al. in |9]. The basic idea 
of 115] was to combine a formulation of Kantorovich’s theorem by means of quadratic majorant 
functions, see |l5] for more general majorant functions, with the dehnitions of good regions for 
the Newton’s method. In these regions, the majorant function bounds the vector helds which the 
zero is to be found, and the behavior of the Newton’s iteration in these regions is estimated using 
iterations associated to the majorant function. Moreover, as a whole, the union of all these regions 
is invariant under Newton’s iteration. Afterward, this technique was successfully employed for 
proving generalized versions of Kantorovich’s theorem in Riemannian setting. Inspired by previous 
work of Zabrejko and Nguen in [45] on Kantorovich’s majorant method, a radial parametrization 
of a Lipschitz-type and L-average Lipschitz affine invariant majorante conditions were introduced 
in Riemannian setting by Alvarez et al. in [3] and Li and Wang in [25], respectively, in order to 
establish existence and local uniqueness of the solution as well as unifying previously convergence 
criterion of Newton’s method. 

In the present paper, we will use the technique introduced in [15], see also HZ], to present a 
robust affine invariant version of the Kantorovich’s theorem on the Newton’s method finding a zeros 
of a differentiable vector field defined on a complete Riemannian manifold. In our analysis, the 
classical Lipschitz condition is relaxed using a general majorant function. The analysis presented 
provides a clear relationship between the majorant function and the vector field under consideration. 
However, the most important in our analysis is the robustness, namely, we give a prescribed ball, 
around the point satisfying the Kantorovich’s assumptions, ensuring convergence of the method 
for any starting point in this ball. Moreover, we establish bounds for Q-quadratic convergence of 
the method which depend on the majorant function. Also, as in [3] and [25], this analysis allows 
us establish existence and local uniqueness of the solution as well as unifying previously results 
pertaining Newton’s method. 

The organization of the paper is as follows. In Section [LT] some notations and one basic results 
used in the paper are presented. In Section [21 the main result is stated, namely, the robust affine 
invariant Kantorovich’s theorem for Newton’s method and in Section [2] the affine invariant version, 
which is used for proving the robust one is stated and proved. In Section [4] we prove the main 
theorem. In Section [5] three special case of the main theorem is presented. Some final remarks are 
made in Section [6l 
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1.1 Notation and anxiliary results 

In this section we recall some notations, definitions and basic properties of Riemannian manifolds 
used throughout the paper, they can be found, for example, in m and [23] . 

Throughout the paper, A1 is a smooth manifold and is the class of all continuously 

differentiable functions on Ai. The space of vector fields C^{M.) on M. is denoted by by 

TpM. we denote the tangent space of A1 at p and by TAi = UxeAl tangent bundle of 

Ai. Let Ai be endowed with a Riemannian metric (•, •), with corresponding norm denoted by || • ||, 
so that Ai is now a Riemannian manifold. Let us recall that the metric can be used to define the 
length of a piecewise curve C : [a,b] ^ Ai by 

i[C,a,b]:= f IIC'WML 

J a 

Minimizing this length functional over the set of all such curves we obtain a distance d{p, q), which 
induces the original topology on Ai. The open and closed balls of radius r > 0 centered at p are 
defined, respectively, as 

B{p, r) := {q £ M : d{p, q) < r} , B\p, r] := {q G M : d{p, q) < r} . 

Let C be a curve joining the points p and qm. Ai and let V be the Levi-Civita connection associated 
to {Ai, (,)). For each t G [a,b], V induces an isometry, relative to (•, •), 

: 7c(a)-Ad ^ 

V '—^ P<:,a,tv = v{t), 

where V is the unique vector field on ( such that = 0 and V(a) = v, the so-called parallel 

transport along ( from ({a) to C{t). Note also that 

PCM,b2 ° Pc,a.,bi = P(,a,b2i Pc,,b,a = P^yO^b ■ ( 2 ) 

A vector field V along is said to be parallel if = 0. If f itself is parallel, then we say that 

C is a geodesic. The geodesic equation V = 0 is a second order nonlinear ordinary differential 
equation, so the geodesic C is determined by its position p and velocity v at p. The restriction 
of a geodesic to a closed bounded interval is called a geodesic segment. It is easy to check that 
IlC^II is constant. We usually do not distinguish between a geodesic and its geodesic segment, as no 
confusion can arise. We say that C is normalized if ||C^|| = 1. A geodesic Q •. [a,h] ^ Ai is said to 
be minimal if its length is equal the distance of its end points, i.e. i[C,a,b] = d{({a),C{b)). 

A Riemannian manifold is complete if its geodesics are defined for any values of t. In this paper, 
all manifolds Ai are assumed to be complete. The Hopf-Rinow’s theorem asserts that if this is the 
case then any pair of points, say p and q, in Ai can be joined by a (not necessarily unique) minimal 
geodesic segment. Moreover, {Ai,d) is a complete metric space and bounded and closed subsets 
are compact. The exponential map at p, exp^ : TpAi —)■ Ai, is defined by exp^u = Cd( 1)5 where Cv 
is the geodesic defined by its position p and velocity v at p and Cv{t) = exp^tu for any value of t. 

Let X G C^{Ai). The covariant derivative of X determined by the Levi-Civita connection V 
defines at each p £ Ai a linear map XX{p) : TpAi —)• TpAi given by 

VX{p)v.= VyX{p), (3) 

where T is a vector field such that Y{p) = v. 
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Definition 1. Let Yi,... ,Yn be vector fields on Ai. Then, the n-th covariant derivative of X with 
respect to Yi,... ,Yn is defined inductively by 

:= Vy,Vy,X, V^{y,Y^X := VyJVy„_, • • • Vy,X). 

Definition 2. Let p £ Ai. Then, the n-th covariant derivative of X at p is the n-th multilinear 
map X"'X{p) : TpAi x ... x TpAi —)• TpAi defined by 

V^X{p){vi,...,Vn) ■.= Xly^^r.^X{p), 

where Yi,... ,Yn are vector fields on Ai such that Yi{p) = vi,..., Yn{p) = Vn- 

We remark that Definition [2] only depends on the n-tuple of vectors {vi,...,Vn) since the 
covariant derivative is tensorial in each vector field Y^. 

Definition 3. Let p ^ Ai. The norm of an n-th multilinear map A : TpAi x ... x TpAi —)• TpAi 
is defined by 


ll^ll = sup{||A(?;i,... ,z;„)|| : vi,... ,Vn € TpAi, \\vi\\ = 1, i = 1,... ,n} . 

In particular, the norm of the n-th covariant derivative of X at p is given by 

||V"X(p)|| = sup{||V"-X(p)(t!i,... ,?;n)|| : vi,...,Vn€ TpAi, llt'iH = 1, i = 1,... , n} . 

Now, the Fundamental Theorem of Calculus for a vector field X becomes 

Lemma 1. Let Q be an open subset of Ai, X a vector field defined on Ll and (^ : ^ D a 

curve. Then 

Pc,t,aX(C(t)) = X(C(a)) + f P^^s,a^Xids)) Cis) ds, t G [a,b]. 

J a 

Proof. See [15]. □ 

Lemma 2. Let Q be an open subset of Ai, X a C'^ vector field defined on Q and C : [a,6] ^ D a 
curve. Then for all Y G X{Ai) we have 

Pc,i,aVX(C(t))y(C(t)) = VX(C(a))F(C(a))+ f P^,s,aV^X{C{s)) {Y{C{s)), {s)) ds, t G [a,b]. 

J a 

Proof. See [23]. □ 

Lemma 3 (Banach’s Lemma). Let B be a linear operator and let Ip be the identity operator in 

TpAI. If \\B — IpW < 1 then B is nonsingular and ||B“^|| < 1/ (1 — \\B — Ip\\) . 

Proof. See, for example, |3Zj. □ 

We also need the following elementary convex analysis result, see [20] : 

Proposition 1. Let I gM be an interval and (p : I ^ M be convex. For any sq G int(I), the left 
derivative there exist (in's.) 

n- / N r (p(so) - <p(s) 

D dso) := hm - -= sup,<,(, -. 

0 So — s So — S 

Moreover, if s,t,r ^ I, s < r, and s ^t ^r then ip{t) — ip{s) ^ [pd) — y^(s)] [{t — s)/{r — s)]. 
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2 Robust Kantorovich’s Theorem on Newton’s Method 


Our goal is to state and prove a robust affine invariant version of Kantorovich’s Theorem on 
Newton’s Method for finding a zero of a vector field: 

X{p) = 0, (4) 

where At is a Riemannian manifold, C At an open set and X : —>• TAt a continuously 
differentiable vector field. The most important in our analysis is the robustness, namely, we give a 
prescribed ball, around the point satisfying the Kantorovich’s assumptions, ensuring convergence 
of the method for any starting point in this ball. Moreover, we establish bounds for Q-quadratic 
convergence of the method which depend on the majorant function. Also, as in [3] and [25], this 
analysis allows us establish existence and local uniqueness of the solution. For state the theorem 
we need some definitions. We beginning with the following dehnition which was introduced in |3|. 

Definition 4. Let R > 0, n G N\{0}, po G Ai and Gn{po,R) be the class of all piecewise geodesic 
curves ^ : [0, T] —A4 for some T > 0 which satisfy the following conditions: 

1. ^(0) = po CLiT'd the length of is no greater than R; 

2. there exist co,ci,...Cn G [0,T] with cq = 0 < ci < ■ ■ ■ < Cn = T such that 

.. .fi, , are n — 1 minimizing geodesics and T, , is a geodesic. 

Remark 1. Since A4 is complete, Gn{po-,R) is nonempty. Moreover, Qn{po,R) C Qn+i{po,R) for 
all n G N\{0}. Note that, in Definition^ Gi{Po-,R) is the class of all minimizing geodesic curves 
^ ; [0,T] —)• A4 with ^(0) = po and the length of is no greater than R. 

We also need the following definition which was equivalently stated in (3.7) of |3|, for G 2 {po, R)- 

Definition 5. Let Ll <G Ai an open set and R > 0 a scalar constanst. A continuously differentiable 
/ : [0, R) — )• M is said to be a majorant function at a point po G Ll for a continuously differentiable 
vector field X : Ll ^ TAi with respect to Gn{po,R) */VX(po) is nonsingular, B{pq,R) C Ll and 

II VX(po)-' [PcAO S7X{m)PLa,b - P5,a,oVX(C(a))] || < f {i[i, 0, h]) - f {i[i, 0, a]) , (5) 

for all ^ G Gn{po,R) with a,b G dom{ff) and 0 < a < b. Moreover, f satisfies the following 
conditions: 

hi. /(O) > 0 and f'{0) = —1; 

h2. /' is convex and strictly increasing; 

h3. f{t) = 0 for some t G (0, R). 

We also need of the following condition on the majorant condition / which will be considered 
to hold only when explicitly stated 

h4. f{t) < 0 for some t G {0,R). 

Remark 2. Since /(O) > 0 and f is continuous then condition h4 implies condition h3. 

The statement of our main result is: 
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Theorem 1. Let Ai be a Riemannian manifold, LI C Ai an open set and 0, its closure, X : 
O —>• TAi a continuous vector field and continuously differentiable on LI, R > 0 a scalar constant 
and / : [0, i?) —)■ M a continuously differentiable function. Take po G LI. Suppose that VX{pq) is 
nonsingular and f is a majorant function for X at po with respect to Qfipo,R) satisfying h4 and 
the inequality 

\\XX{po)-^X{po)\\<f{0). (6) 

Define F := sup{—/(i) : t G [0, R)}. Let 0 < p < r/2 and g : [0, i? — p) —)• M, 

9{t) ■= +p)+ ‘^p]- ( 7 ) 

Then g has a smallest zero t^.^p G {Q,R — p), the sequences generated by Newton’s Method for solving 
the equation X{p) = 0 and the equation g(t) = 0, with starting point q^, for any go £ B\pQ,p], and 
to = 0, respectively, 

qk+i = ex.pg^{-VX{qk)~^X{qk)) , 4 + 1 = 4 -^^, A: = 0,1,... . ( 8 ) 

are well defined, {g^} is contained in B{qQ,t^^p), { 4 } is strictly increasing, is contained in [ 0 ,t*,p) 
and converges to t^.^p. Moreover, {gfc} and {t^} satisfy the inequalities 


d{qk,qk+i) < 

ct 

inequalities 


d(gfc,gfc+i) < 4+1 - 4 , A; = 0 , 1 ,..., (9) 

d{qk-i,qkf < ^^^}l*’^] d{qk-i,qkf, A: = 1,2,... (10) 


(4-4_i)2 -2g'{U^p) 

and {qk} converges to p* G 4[go,4,p] such that X{pfi) = 0. Furthermore, {qk} and {tfc} satisfy the 


di^qkiP*) — A*,p tk, ^*,p 4+1 ^ 2 (^*:P Afc), k 0,1,... , (H) 

the convergence of {qk} and { 4 } top* and t*^p, respectively, are Q-quadratic as follow 


d{pk+i,P*) . D g'(4,p) D g'(4,p). .2 , ni 

hmsup „ ,, < ^ \ , 4,p-4+i < 3 : 777 —^(A*,p-Afc) , A; = 0,1,.... ( 12 ) 


fc—>00 P*)"^ 2 p^(t*^p) 

and p* is the unique singularity of X in B{pQ,f), where f >t* is defined as 


T := sup{t G [t*,i?) : /(t) < 0}. 


To prove the above theorem we need some previous results. First, in the next section, we prove 
a particular instance of this theorem, and then, in the Section [3.51 we prove Theorem [TJ 


3 Kantorovich’s Theorem on Newton’s Method 

In this section we will prove an affine invariant version of Kantorovich’s Theorem on Newton’s 
Method, it is a particular instance of Theorem [TJ namely, the case p = 0. We will use this 
theorem for proving Theorem [TJ The main results of this section are the bounds, depending on 
the majorant function, for the Q-quadratic convergence of the Newton’s Method, which gives an 
additional contribution for improving the results of Alvarez et al. in [3], Ferreira and Svaiter in |15j 
and Li and Wang in |25j . 


6 









Theorem 2. Let Ai be a Riemannian manifold, LI C Ai an open set and 0, its closure, X : 
O —^ TAi a continuous vector field and continuously differentiable on LI, R > 0 a scalar constant 
and / : [0, i?) —^ M a continuously differentiable function. Take po € ri. Suppose that VX(po) 
is nonsingular and f is a majorant function for X at po with respect to Q 2 {po-,R) satisfying the 
inequality 

\\XX{po)-^X{po)\\<f{0). (13) 

Then f has a smallest zero t* G (0, i?), the sequences generated by Newton’s Method for solving the 
equations X{p) = 0 and f{t) = 0, with starting point po and to = 0, respectively, 

Pfc+i = expp^ (-VX(pfc)"^X(pfc)) , tk+i = tk-jj^, k = 0,l,.... (14) 

are well defined, {pk} is contained in B{pQ,tfi), {tk} is strictly increasing, is contained in [0, i*) 
and converge to and satisfy the inequalities 

d{pk+i,Pk) < 4+1 - 4 , d{pk+i,Pk) < \ d{pk,Pk-if, ( 15 ) 

(4 - 4-i) 

for all k = 0,1,... , and k = 1,2,..., respectively. Moreover, {pk} converge to p* G B\pq, i*] such 
that X{pf) = 0, 

Pk') — t^ tk, 4 1/c+l ^ ^ ^k)-! ^ 0, 1? • • • (15) 

and, therefore, {4} converges Q-linearly to 4 and {pk} converge R-linearly to p*. If, additionally, 
f satisfies h4 then the following inequalities hold: 


d{pk+i,Pk) < ^^j}^^*}^ d{pk,Pk-i)^, 4+1-4 < 32^^(4-4-i)^, k = 1,2,... , (17) 

and, as a consequence, {pk} and {tk} converge Q-quadratically to p* and 4; respectively, as follow 


lim sup 

fc^OO 


d{p*,Pk+i) 

d{p*,Pk)‘^ 


^ D-f'iK) 
~ -2/'(4) ’ 




4+1 < 


D-f'iK) 

-2/'(4) 


(4 


4)^, k = 0,l,... , 


(18) 


and p* is the unique singularity of X in B{pq,t), where t >t^ is defined as 


T := sup{t G [4,-R) : f{t) < 0}. 


Henceforward we assume that all assumptions in above theorem hold. In this section, we will 
prove all the statements in Theorem [2] regarding to the majorant function and the real sequence 
{tk} associated. The main relationships between the majorant function and the vector field will be 
also established. 


3.1 The majorant function 

In this subsection we will study the majorant function / and prove all results regarding only the 
real sequence {4} defined by Newton’s method applied to the majorant function /. Dehne 

i := sup {t G [0, R) : f'{f) < O} . (19) 
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Proposition 2. The majorant function f has a smallest root t* G (0,-R), is strictly convex and 

/(t) > 0, f{t)<0, t <t - f{t)/f{t) <U, VtG[0,i*). (20) 

Moreover, f'{tif) ^ 0 and 

f'{U) < 0 3 t G f{t) < 0. (21) 

If, additionally, f satisfies condition h4 then the following statements hold: 
i) f{t) < 0 for any t G [Oj); 
a) 0 < t < R; 

Hi) 0 < r < i, where F := — f{t). 

iv) // 0 < p < r/2 then p < t/2 < i and f'{p) < 0. 


Proof. See Propositions 2.3 and 5.2 of m and Proposition 3 of US]. □ 

In view of the second inequality in (f20]l . Newton iteration is well defined in [0,t*). Let us call 

it Uf : [ 0 , t*) — ^ M, 

nf{t) :=t- f{t)/f'{t). (22) 

Proposition 3. Newton iteration nj maps [0,t*) into [0, t*) and there hold: 

t<nf{t), - re/(t) < ^(t* -1), VtG[0, t*). (23) 

If f also satisfies (h4), *.e., /'(t*) < 0, then 

t*-nf{t) VtG[0,r). (24) 

Proof. See Proposition 4 of m- □ 

The next two results follow from above proposition. 

Corollary 1. Take anyro G [0,t*) and define, inductively, = nf{Tk), k = 0,1,.... The sequence 


{rfc} is well defined, is strictly increasing, is contained in [0,t*) and converges Q-linearly to as 
follows 

T~k+1 — 2 k — 0 , 1 , . . . 

In particular, the definition (|14p of {t^} in Theorem [2] is equivalent to the following one 

to = 0, 4+l = ?^/(4), A: = 0,1,.... (25) 

and there holds 

Corollary 2. The sequence {tk} is well defined, is strictly increasing, is contained in [0,t*) and 
converges Q-linearly to t* as follows 

t* tk-i-i — 2 4)) k — 0 , 1 ,... 

If f also satisfies h4, then the following inequality holds 

4+1 - tfc < ~ ^k-if, k = l,2,... , (26) 

and, as a consequence, {tfc} converges Q-quadratically to t* as follow 

D-f'iU) 
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3.2 Relationship between the majorant function and the vector field 

In this subsection we wiii estabiish the main reiationship between the majorant function and the 
vector fieid necessaries to prove Theorem [2l 

Proposition 4. Let ^ G G 2 iPo,R)- < t < t then VX(^(s)) is nonsingular and the 

following inequality holds 

Proof. Using Definition [5] and Lemma O the proof foiiows the same pattern of Proposition 3.4 of 
m, see aiso Lemma 4.2. of m- □ 

Newton iteration at a point happens to be a zero of the iinearization at such a point. Therefore, 
we study the iinearization error of the vector heid and the associated majorant function. The 
formai definitions of these erros are: 

Definition 6. Let / : [0, R) be a continuously differentiable function. The linearization error 
of f is defined by 


e{a,b) := fib) - [f{t) + f{a){b - a)] , 'da,b£[0,R). (28) 

Definition 7. Let Ad he a Riemannian manifold, Ll T Ad an open set, X : D —>• TA4 a continuously 
differentiable vector field and a,b £ The linearization error of X on a geodesic C : [a,&] ^ D 

is defined by 


E (C(a), Cib)) := XiCib)) - P^,a,b [X(C(a)) + (6 - a)VX(C(a))C'(«)] • (29) 

In the next result we compare linearization error of the vector field with the linearization error 
of the majorant function associated. 

Lemma 4. Let ^ G G 2 iPo, R) be a curve passing through p = ^(u) and q = f,ib) .such that ^ is a 

geodesic and 0 < a < b. Take 0 < t < x < R. If i[f,, 0, a] <t and a,b\ < x — t, then 

\\XX{po)-^P^,b,oEip,q)\\ < eit,x)^-^^. 

As a consequence, the following inequality holds: \\XXipQ)~^P^^i„oEip,q)\\ < e{t,x). 

Proof. Definition [7] with ^ properties of parallel transport in ([2]) imply 

Eip,q) = Pi,a,b [Pi,b,aXiq) - Xip) - (b - a)VX(p)e'(a)] . 

Hence, using Lemma[I]and that ^'(s) = P^^a,sf,'i(^), the last equality becomes 

Eip, q) = P^,a,b f [P^,s,aS/Xiiis))P^^a,s " VX(p)] ^'(ajds, 

J a 

which is equivalent to 

fb 

VXipo)-^P^,b,oEip,q) = / VXipo)-^ [Pi,s,oVXias))Pi,a,s - Pi,a,oXXip)]eia)ds. 

J a 
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Since ^ ; [a, 6] —)• is a geodesic joining p and q we have ||^'(a)|| = i[^,a,b]/{b — a). Thus last 

equality implies 


\\\/X{po)-^P^,b,oE{p,q)\\ < 

I' ||VX(po)-' [P^,.,oVX(e(s))P^,a,. - P^,,,oVX(p)]|| %^ds. (30) 

Because a < s < b, using the assumptions 0, a] < t and i[^, a, b] < x — t we have 

0, s] < £[^, 0, a] + a,b] < X < R, 


and as ^ : [0, s] —)• Ad is a piecewise geodesic curves joining the points po to ^(s) through p, i. e., 
^ £ G 2 {po,R), we may use the majorant condition in Definition [5] with b = s and q = ^{s) together 
with inequality in (|3n]) to conclude that 

\\VX{po)-^P^,b,oEip,q)\\ < f [fii[C,0,s])-fii[C,0,a])] ^|^ds. 

J a 0 CL 

Using convexity of /', 0, a] < t, a,b] < x — t, x < R and Proposition [T] we have 


f 0, s]) - /' (£[^, 0, a]) = /' {e[C, 0, a] + i[^, a, s]) - /' {e[C, 0, a]) 

< /'(t+ ^[^,a,s]) - /' (t) 




< 


b — a 




b — a 


^[C,a,b] 

X — t 


Therefore, combining two last inequality we obtain that 


j-b 

\VX{po)-^P^,b,oE{p,q)\\ < / 

J a 




— a 




{x — t){b — a) 


ds. 


After performing the integral and some algebraic manipulations the above inequality becomes 
||VA(po)"^Pc,fe,oP(p,9)|| < [fix) - fit) - f'{t){x - t)] 
which, Dehnition [6l implies the desired inequality. 


□ 


Proposition m guarantees, in particular, that VX{p) is nonsingular at p G B{pQ,t^) and, conse¬ 
quently, the Newton’s iteration is well defined in B{pQ,t*)- Let us call it Nx ■ B{po,t*) 

Nxip) ■= expp{-VX{p)-^X{p)). (31) 

One can apply a single Newton’s iteration on any p G B{pQ,t^) to obtain the point Nxip) which 
may not is contained to B{pQ,tif), or even may not in the domain of X. Hence, this is enough to 
guarantee the well-dehnedness of only one iteration. To ensure that Newtonian iteration may be 
repeated indefinitely, we need some additional dehnitions and results. First, we define some subsets 
of P(poT*) ill which, as we shall prove, Newton iteration ()3ip is “well behaved”: 

K{t):=l^pen : dipo,p)<t, ||VA(p)"^A(p)|| ^, 

K:= [J K{t), 
te[o,u) 
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t£[0,U). (32) 

(33) 















In ([32l) . 0 ^ t < i* < i, hence using Proposition [2] and Proposition U] we conclude that f'{t) / 0 
and VX{p) is nonsingular in B\pQ^t\ C respectively. Therefore the above definitions are 

consistent. It is worth point out that the above sets appeared for the first time in IS]; see also m- 


Lemma 5. For each t G [0,f*) and each p G K{t) there hold: 

i) \\VX{p)-^X{p)\\<-M; 

ii) d{pQ,p) + ||VX(p)“^X(p)|| < nf{t) < t*. As a consequence, d{po, Nx{p)) < nf{t) < t*. 

iii) \\VX{Nx{p))-^X{Nx{p))\\<-j^^^^ 


XX{p)-^X{p)\ 

-m/nt) 


Proof. Let t G [0,f*), p G K{t). Using definition of the set K{t) in (1321) the item i follows. 

Using Proposition [3] and definition of K{t) in (1321) to obtain that d{pQ,p) < t and nf{t) < 
t*, respectively. Hence, the proof of the first part of item ii follows by combination of two last 
inequalities with item i and definition of nj in ()22p . For proving the second part of item ii use 
triangular inequality to obtain d{po, Nx{p)) < d{po,p) + d{p, Nx{p)), definition in (f3T]l and then 
first part. 

We are going to prove item iii. Let ^ : [0, 2] —>• a piecewise geodesic curve obtained by 
concatenation of a minimizing geodesic ^ joining po and p and the geodesic curve defined 
by 


= expp{{l-t)XX{p) ^X{p)). 


(34) 


Note that ^ G G 2 {po^R)- From definition of the piecewise geodesic curve ^ and definitions in (|3T]l 
and (p^ we have 

£[^,0,2]=d{po,p) + \\XX{p)-^X{p)\\. 

Since ^(2) = Nx{p), using last equality, first inequality in item ii and Proposition (H by taking into 
account that the derivative f is increasing and negative in [0, f), we conclude that XX{Nx{p)) is 
nonsingular and there holds 


\\VX{Nx{p)) 'P5,o,2VX(po)|| < + ||VX(p)-iX(p)||)| - irinjim ' 

On the other hand, as 1,2] = ||VX(p)“^X(p)||, combining item i with definition of Uf in (|22p 
we obtain ^[^,1,2] < n/(f) — t. Since second part in item ii imples d {po, Nx{p)) < nf{t) < t*. 
Thus, we may apply Lemma 0] with x = nf{t) and q = Nx{p) to conclude that 


\\XX{po)-^P^,2,oEip,Nx{p))\\ < ( 36 ) 

We know that Nx{p) belongs to the domain of X. Hence, Newton’s iterations in (|31jl . linearization 
error in Definition [7] with C = ^|[i 2 ] ^^d (13^) yield 


E{p,Nx{p)) = X{Nx{p)) - Pi, 1,2 [X{p) + XX{p) {-XX{p)-^X{p))] , 


which is equivalent to E{p, Nx{p)) = X{Nx{p)). Thus, using this equality we obtain after simples 
algebraic manipulation that 


VX{Nx{p))-^X{Nx{p)) = VXiNx{p))-^Pi,o,2XX{po)VX{po)-^Pi,2,oE{p, Nx{p)). 
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Taking norm is last equality and using the inequalities (l35|) and (l36l) we easily conclude that 


VX{Nx{p))-^X{Nxip))\\ < 


e{t,nfit)) ||VX(p)-iX(p)| 
|/'(n;(t))| 


Finally, since nf{t) belongs to the domain of /, using the definitions of Newton iterations on (I22p 
and definition of the linearization error in (j28p . we obtain f{nf{t)) = e(t,nf{t)) which combined 
with nf{t) — t = f{t)/f'{t) and last inequality implies the desired result. Therefore, the proof of 
the lemma is concluded. □ 


Lemma 6. For each t G [0,t*) the following inclusions hold: K{t) C B(pQ,t^:) and 

Nx{K{t))cK{nf{t)). 

As a consequence, K C B{pQ,t^f) and Nx{K) C K. 

Proof. The first inclusion follows trivially from the definition of K(t) in (1321) . Combining items i 
and iii of Lemma [5] we have 

||VX(iV_.(p))-X(iV_.(p))||<2»^ 

Therefore, the second inclusion of the lemma follows from combination of last inequality in item ii of 
Lemma O last inequality and definition of K{t). The first inclusion on the second sentence follows 
trivially from definitions (j32p and (1331) . To verify the last inclusion, take p £ K. Then p € K{t) for 
some t G [0, t*). Using the first part of the lemma, we conclude that Nx{p) F K(nf{t)). To end 
the proof, note that nf{t) G [0, U) and use the dehnition of K in (l33p . □ 

We end this session limiting the derivative of the vector field by the derivative of the majorant 
function. 

Proposition 5. Ifd{po,p) <t < R then ||VX(p)|| < \\VX{pq)\\{ 2 + f'{t)). 

Proof. Let ^ : [0,1] —)• is a minimizing geodesic joining po to p. After some algebraic manipula¬ 

tions we have 


||VA(po)-^Pg,i,oVX(p)|| = ||VX(po)-i [P 5 ,i,oVA(p)P 5 ,o,i - VA(po) + VX(po)]|| 

< i|VA(po)-i [P^,i,oVA(p)P^,o,i - VX(po)]|| + ||/pj| . 

Since ^ is a minimizing geodesic joining po to p we have 0,1] = d(po,p). Thus, using that / is 
a majorant function at a point po for the vector field X, above inequality yelds 

||VA(po)-'Pg,i,oVA(p)|| < /'(d(po,p)) - /'(O) + 1 < 2 + fit), 

because d(po,p) < t and f' is a increasing function. Finally, using last inequality and taking into 
account that 

||VX(p)|| < ||VA(po)|| ||VA(po)-iP 5 ,i,oVA(p)|| , 

the desired inequality follows. □ 
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3.3 Convergence 

In this section we establish all the convergence results stated in Theorem [2] related to {pk}, the 
sequence generated by Newton’s Method, namely, the convergence of {pk} to a zero of X, the 
bounds in (fT^ . (fT6|) . (fT71) and (jlSp . For establish these results we will combine conveniently the 
results of the previous section. We begin with the following result: 

Proposition 6. Let {zk} be a sequenee in A4 and C > 0. If {zk} converges to z* and satisfies 

d{zk,Zk+i) <Cd{zk-i,Zk)‘^, /c = l,2,.... (37) 

then {zk} converges Q-quadratically to z^ as follows 

d{zk+i,z^) 

dizk zv 

k—¥oo 

Proof. The proof follows the same pattern as the proof of Proposition 1.2 of m- □ 

Using equality in (fT4|l and (|3n) . the sequence {pk} generated by Newton’s Method satisfies 

Pk+i = Nx{pk), k = 0,l,.... (38) 


This equivalent definition of the Newton’s sequence {pk} allow us to use the results of the previous 
section to establishes its properties of convergence. 

Corollary 3. The sequence {pk} is well defined, is contained in B{pQ,tfi) and satisfies the in¬ 
equalities in (II5|). Moreover, {pk] converges to a point p^, G satisfying X{p^,) = 0 and 

its convergence rate is R-linear as in (I16p . If, additionally, f satisfies h4 then the inequality (ffH) 
holds and, consequently, {pk} converges Q-quadratically to p^, as in (fTSl) . 

Proof. We are going to prove that the sequence {pk} is well defined. First note that, combining 
m, (H and hi we have 

po eK{0)c K, (39) 

where the second inclusion follows trivially from (|33p . Using the above inclusion, the inclusion 
Nx{K) <Z K In Lemma[6]and (j38h we conclude that {pk} is well defined and rests in K. From the 
hrst inclusion on second part of the Lemma [6] we have trivially that {pk} is contained in B{pQ,tfij. 
Now we are going to prove the inequalities in (I15p . First we will prove, by induction that 

Pk G K{tk), /c = 0,1,... . (40) 

The above inclusion for k = 0 follows from ()39p . Assume now that pk G K[tk). Thus, using 
Lemma El (IMl) and (l25p . we obtain that Pk+i G K{tk+i), which completes the induction proof 
of ()i0l) . Using dehnition of {tk} in (fTTp , we have — f (tk) / f (tk) = tk+i — tk- Hence combining 
definition of {pk} in (I14h with (I40p and item i of Lemma El we obtain 

d(pk,Pk+i) = \\^X{pk)~^X{pk)\\<tk+i-tk, A: = 0,1,.... (41) 


which is hrst inequality in (|15p . In order to prove the second inequality in (1151) . hrst note that 
Pk-i G K{tk-i), Pk = Nx{pk-i) and 4 = n/(4_i), for all A: = 0,1, ... . Thus, apply item iii of 
Lemma El with p = pk-i and t = 4-i to obtain 


d{pk,Pk+i) < 


/(4) \ d{pk-i,Pk) Y‘ 
f'{tk) [ tk - 4-1 J 
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which using second inequality in (1141) yields the desired inequality. 

To prove that {pk} converges to p* G B\pQ,t^] with X{p^) = 0 and (fT5]) holds, first note that as 
{tfc} converges to t*, the first inequality (fT^ implies 

OO OO 

d{pk+i,Pk) ^ ^ 4+1 -4 = 4- 4o < +00, (42) 

k=ko k=ko 

for any ko G N. Hence, {pk} is a Cauchy sequence in B{pQ,t^,) and ,thus, converges to some 
p* G 4[po,4]- Therefore, first inequality (fTHl) also implies that < t* — tfc for any k. Hence, 

the inequality (fTSll holds and, as {tfc} converges Q-linearly to t*, {pk} converges ii-linearly to p*. 
For proving that X{p^) = 0, note that first inequality in (fTHI) implies that d{po,pk) < tk — to = tk- 
Thus using Proposition [5] we have 

||VX(pfc)|| < ||VX(po)||(2 + /'(4)), fc = 0,l,... , 

which combining inclusion (j40p and second inequality in (|4ip yields 

\\X{pk)\\ < \\VX{pk)\\\\VX{pk)-^X{pk)\\ < ||VX(po)||(2 + /'(4))(4+i-4), A; = 0 , 1 ,... . 

Since X is continuous on H, {pk} C B{pQ,t^,) C H, {pk} converges to p* G H, the result follows by 
taking limit as k goes to infinite in above inequality. 

Now, we assume that h4 holds. Thus, combining second inequality in ()15p with ()26p . we obtain 
the inequality in (fTTp . To establish the inequality in (fTSp . use inequality in (fTTP and Proposition [6] 
with Zk = Pk and C = D“/'(4)/(—2/'(4))- Therefore, the proof is concluded. 

□ 


3.4 Uniqueness 

In this section we prove the last statement in Theorem [2l namely, the uniqueness of the singularity 
of the vector field in consideration. The results of this section generalize m Section 3.2 ] for a 
general majorant function, see also [31 Section 4.2 ]. 

Corollary 4. Take 0 < t and q G K{t). Define 

To = t, Tk+i = Tk - f{Tk)/f'{Tk), k = 0,l,.... 

The sequence {qk} generated by Newton’s method with starting point qo = q is well defined and 
satisfies qk G K{Tk), for all k. Furthermore, {rfc} converges to t*, {qk} converges to some q^ G 
B\po,t*] ® singular point of X and d{qk,q*) < 4 — Tk, for all k. 

Proof. The proof is a convenient combination of Lemma [6l Corollary [1] and Proposition [5l following 
the same pattern of Corollary 3.6 of dS]. □ 

The next two lemmas are most important results we need to prove the uniqueness of solution. 
The idea of its proofs are similar to the corresponding results of m, see also [3]. In this more 
general approach, some technical details related to the parallel transport and the majorant function 
(possibly non-quadractic) should be used. 

Lemma 7. Take 0 < t and p ^ K{t). Define for 0 G M 

c(0) = exppi-exx{p)-^x{p)), T{e) = t - 

Then for 9 G [0,1] we have t < t{9) < t^ and ({9) G K{t{9)). 
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Proof. The proof follows the same pattern of |16l Lemma 3.7], see also [3l Lemma 4.4]. □ 

Lemma 8. Take 0 < t and p G K{t). Suppose that G B\pQ,tf\ is a singular point of X 
and t + d{p,qt:) = t*. Then d{pQ,p) = t. Furthermore, t < nf{t) < t*, Nx{p) G K{nf{t)) and 
nf{t) + d{Nx{p), O'*) = t*. 

Proof. The proof follows the same pattern of [161 Lemma 3.8], see also [31 Lemma 4.5]. □ 

The proof of the next two results can be obtained by a simple adaptation of some arguments 
of [ini Corollary 3.9] and [I6l Lemma 3.10], see also [31 Lemma 4.5] and [31 Section 4.2.2], we also 
omit their proofs. 

Corollary 5. Suppose that g* G B\pQ,t^] is a singular point of X. If for some t,q 

0 < t < t*, g G K{i), 
and i + d{q, g*) = t*, then d{po, g*) = t*. 

Lemma 9. The sequence {pk} has limit p^, as the unique singular point of X in B\pQ,tf\. 

Lemma 10. Let q G B{pq,R) and ^ : [0,1] —)• A4 a minimizing geodesic in Qi{pq,R) joinning po 
to q. Then the following inequality holds: 

-f{d{po,q)) < \\VXipo)-^Pi,i,oX{q)\\. 

As a consequence, p* is the unique singularity of X in B{po, f), where f := sup{t G [t*, R) : f{t) < 
0}. 

Proof. Applying second part of Lemma 0] with p = pQ, a = 0, b = 1, t = 0 and x = d{po, q) we have 

e{0,d{po,q)) > ||VX(po)"^T’^,i,o^(Po,g)|| • 

From Definition [7l last inequality becomes 

e{0,d{po,q)) > ||VX(po)-'7’s,i,o^((?) - VX(po)-'^(po) -^'(0)11 . 

Using triangular inequality in the right hand side of last inequality, it is easy to see that 

eiO,d{po,q)) > ||e'(0)|| - ||VX(po)-'^(po)|| - ||VA(po)-'7^s,i,o^(g)|| • 

Combining Definition [6] with assumption (fTSl) and taking into account that llC^(0)jj = d{po,q) and 
/'(O) = —1, we obtain from last inequality that 

f{d{pQ,q)) - [f{0) + f'{0)d{po,q)] > d{po,q) - f{0) - ||VX(po)"^7’5,i,o^(g)|| , 

with is equivalent to the inequality of the lemma. Hence the first of the lemma is proved. 

For the second part, first note that in the interval the sign of / is negative. Hence, first 

part of the lemma implies that there is no singularity of X in B(pQ,f)\B\pQ,t^:]. Therefore, from 
Lemma[9l the unique singularity of X in B{pQ,f) is p* G H[poT*]- D 

3.5 Proof of Theorem [2] 

The proof of Theorem [2] follow by direct combination of Corollary O Corollary [3] with Lemma [101 
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4 On the proof of the main theorem 

In this section Theorem [3] wiii be used to prove a robust semi-iocai affine invariant theorem for 
Newton’s method for hnding a singuiarity of the vector fieid X, nameiy, Theorem [TJ The foiiowing 
resuit wiii be needed. 

Proposition 7. Let R > 0 and f : [0, i?) —)■ M a continuously differentiable function. Suppose 
that po G Q, f is a majorant function for X at po with respect to G 3 {po,R) and satisfies h4. If 
0 < p < r/2, where T := sup{—/(t) : t G [0,i?)}, then for any qo G B\pq, p] the derivative VX((7o) 
is nonsingular. Moreover, the scalar function g : [0, R — p) ^ M, 

9{t) = jj^^[f{t + p) + 2p], 

is a majorant function for X at go with respect to G 2 {qo,R — p) and also satisfies condition h4. 

Proof. Since the domain of / is [0,R) and f'{p) < 0 (see Proposition [2] item iv ), we conciude 
that g is weii defined. First we wiii prove that function g satishes conditions hi, h2, h3 and h4. 
Definition of g and f'{p) < 0 triviaiiy impiy g'ff) = —1. Since / is convex and /^(O) = —1 we have 
f{f) + t> /(O) > 0, for aii 0 < t < 12, which, by using Proposition [2] item iv and that 0 < p, yieids 
g{0) = [f{p) + 2p\/\f'{p)\ > 0, hence g satisfies hi. Using that / satisfies h2, we easiiy conciude 
that g aiso satisfies h2. Now, as p < r/2, using Proposition [2] item Hi, we have 

iim g{t) = J. {2p - P) < 0 , 
t^t-p \f’{p)\ 

which impiies that g satisfies h4 and, as g is continuous and ^(0) > 0, it aiso satisfies h3. 

To compiete the proof, it remains to prove that g satisfies Q. First of aii, for any G B\pQ,p\, 
from Proposition [2] item iv, we have d{qQ,po) < p < t. Let t] : [0,1] -G- Ai be the minimizing 
geodesic joining pq to go- Since rj G Gi{po,R) C Q 2 {Po-,R) and d{pQ,qo) = i[T],0, 1] < p < t we can 
apply Proposition m to obtain that VX(go) is nonsingular and 

||VX(go)-'P,,o,iVX(po)|| < ^ 7 ^. (43) 

Because B{pq,R) C D, for any go G B[pQ,p], we trivially have B{qo, R — p) C D. Let /i : [0, T] —)• A1 
such that p G G 2 {qo, R — p) and co, ci, C 2 G [0, T] with cq = 0 < ci < C 2 = T such that p\ is a 

minimizing geodesic and is a geodesic. Take a,b G [0,T] with 0 < a < b. Thus 

p{a), p{b) G B{qo,R- p), e[p,0,a] + e[p,a,b] < R - p, d{qo, p{a)) = e[p,0,a]. 

Using definitions of the curves p and p, properties of the parallel transport, property of the norm 
and simple manipulation, we conclude that 

\\XX{qo)-^[Pf,,b,o^X{p{b))P^^a,b- Pp,afl^X{p{a))]\\ < 

||VX(go)-ip^,o,iVX(po)|| \\XX{po)-^Pp,po [Pp,b,o^X{p{b))P^,a,b - Pp,a,o^X{p{a))]\\ . (44) 

Now we are going to estimate the second norm of the right hand side of above inequality. First, we 
define ^ : [ 0 , T] —)• A 1 a piecewise geodesic curve in G 3 {po,R) as concatenation between the curves 
p and p, i.e., take cq = 0 < ci < C 2 < C 3 = T such that 

^l[co, cil “ ^l[0, 1]’ C 2 ] ~ ^l[0, cil’ ^I[C 2 , eg] “ eg] ' 
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Definition of ^ in (I45p and definition of curve /i imply that there exist a,b € dom(,^) with 0 < 
a < b such that ^(a) = //(a) and ^( 6 ) = /i( 6 ). Therefore, properties of parallel transport yield 
PrJ,l,oP^l,b,o = P^,b,o- Hence, 


VX(po)"'H^,i,o[H^AoVX(/i(6))P^,„,b- P^,„,oVX(/r(a))] || = 


VX(po)-' P^,a,oVX(C(a)) 


Since ^ G Q^{po,R) and / is a majorant function for X at po with respect to Q^{po,R), applying 
Dehnition [5] with a = a and b = b, last equality becomes 


VX(po) [Pti,b,o'^X{p{b))Pfj_^a,b 


P,,afl^X{p{a))]\\ 


rmO,a]). (46) 


Combining last inequality with (l43]l . (f44]l and (j46]l we obtain 


XX{qo)-^[Pf,,b,o^X{p{b))P^^a,b 


P^,,,oVX(^(a))]|| < 

1 


r ( 4 ^, 0 ,5]) 


f (^[C, 0 ,a]) 


(47) 


Since f is convex, the function s i—)• f'{t + s) — f'{s) is increasing for t > 0. Hence taking into 
account that definitions of ^ in () 1 ^ and p imply 0 , a] = 0 , ci] +^[^, ci, a] < p + i[p-, 0 , a] and 

0 , 6 ] = £[^, 0, a] + a,b] < p + i[p, 0, a] + i[p, a, b], we conclude that 

f 0 , b]^ - f (^[^, 0 , a]) < f{p + i[p, 0 , a] + £[p, a, 6 ]) - f\p + e[p, 0 , a]). 


Since i[p,0,b] = £[p,0,a] + i[p,a,b], combining inequality in (HT)) and last inequality with the 
dehnition of the function g we have 


||VX(go)"^ [P^,,b,oVX{p{b))P^,a,b - Pf,,a,o^X{pia))]\\ < g' ii[p,0,b]) - g' {i[p,0,a]), 
implying that the function g satishes Q, which complete the proof of the proposition. □ 


Proposition 8 . Let q £ B{po, R) and ^ : [0,1] —A4 a minimizing geodesic joinning pq to q. Then 
the following inequality holds: 

||VX(po)-'n,i,o^((7)ll < f{d{po,q))+2d{po,q). (48) 

Proof. Applying second part of Lemma 0] with p = pQ, a = 0, b = 1, t = 0 and x = d{po, q) we have 

e{0,d{po,q)) > \\VX{poy^P^ppE{po,q)\\ . 

From Dehnition [7] last inequality becomes 

e{0,dipo,q)) > \\VX{po)-^P^,i,oX{q)-VX{po)-^Xipo)-^'iO)\\ . 

Using triangular inequality in the right hand side of last inequality, it is easy to see that 

eiO,d{po,q)) > ||VX(po)-'P?,i,o^(<?)|| - \\VX{po)-^Xipo)\\ - ||e'( 0 )|| . 

Combining Dehnition [ 6 ] with assumption (fTSll and taking into account that ||^'(0)|| = d{po,q) and 
/'(O) = —1, we obtain from last inequality that 

f{d{po,q))- [f{0)+f{0)d{po,q)] > \\VX{po)~^P^ppX{q)\\ - f{0) - d{po,q), 

which is equivalent to the inequality of the lemma. Hence the lemma is proved. □ 
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4.1 Proof of Theorem [T] 

Proposition [3 claims that for any qq G B\pq, p] the derivative VX(go) is nonsingular. Moreover, 
the scalar function g : [0, R — p) ^ W, 

sit) = +p) + 2p], (49) 

is a majorant function for X at qo with respect to G 2 iqo,R — p) and also satisfies condition h4. 
Let ^ : [0,1] —)• A4 a minimizing geodesic joining pQ to g'o- Since item iv of Proposition [2] implies 
^[^,0,1] = d{po,qo) < p < t, thus Proposition 0] give us 

||VX(go)-'Pc,o,iVX(po)|| < 

Combining property of norm with last inequality and Proposition [8] with q = qo, we have 
||VX(go)-'^((?o)|| < ||VX(go)-'P€AiV^(Po)||||VX(po)-'Pc,i,o^(®)|| 

< |y 7 ^[/(d(go,Po)) + 2d(^o,Po)]- 

As /' > —1, the function t i—)• /(t) + 2t is (strictly) increasing. Using this fact, above inequality, 
d{po,qo) < p and (l4^ we conclude that 

||VX((?o)-'A(go)|| <5(0). 

Therefore, last inequality allow us to apply Theorem [2] for X and the majorant function g at point 
<70 for obtaining the desired result. 

5 Special cases 

Kantorovich’s theorem under a majorant condition in Riemannian settings was used in [3], see 
also [25] to prove Kantorovich’s theorem under Lipschitz condition in Riemannian manifolds m. 
Smale’s theorem [37] and Nesterov-Nemirovskii’s theorem m- Using the ideas of [3] we present, 
as an application of Theorem [T] a robust version of these theorems. 

5.1 Under Lipschitz’s condition 

Theorem 3. Let Ai he a Riemannian manifold, U C A4 an open set and LI its closure, A : U —)• 
TAi a continuous vector field and continuously differentiable on Q. Take po G Lt, L > 0, fi > 0 
and R = sup{r > 0 : B{po,r) C U}. Suppose that VA(po) is nonsingular, Bipo, 1/L) C U, 

||VA(po)-M^CAoVA(C(6))P^,a,6- Pc,a,oVA(C(a))]|| <LfK,a,6], 

for all L in GziPOiR) o-'nd 2/3L < 1. Moreover, assume that 

||VA(po)-'X(po)|| < /3. 

Let 0 < p < (1 — 2/3L)/(4L) and t^^^p = (^1 — pL — y^l — 2L(/3 + 2p)'j /L. Then the sequence 
generated by Newton’s Method for solving the equations X{p) = 0, with starting point qo, for any 
qo G B\po,p], 

qk+i = expq^ i-XX{qk)~^X{qk)) , k = 0,l,... . 


18 






is well defined, {qk} is contained in o,nd satisfy the inequality 


d{qk,qk+i) < 


L 

2^1 - 2L(/3 + 2p) 


d{qk-i,qkf, 


k = l,2,... 


Moreover, {< 7 ^} converges to p^, G such thatX{pfij = 0 and the convergence is Q-quadratic 

as follows 

d{qk+i,p*) L 

lim sup —-^ < - , . 

k^oo d(^Qk-) 2'y/I — 2Z^(/3 “h 2/)) 

Furthermore, if B{pq,t) C ri then p^^ is the unique singularity of X in B{pq,t), where r := 
(1 + VI - WL) /L. 

Proof. The proof follows from Theorem [T] with the quadratic polynomial f{t) = — t + fi as the 

majorant function to X with respect to Gsipo, 1/T) and T = (1 — 2/3L)/(4L). □ 


5.2 Under Smale’s condition 

Theorem 4. Let A4 be an analytic Riemannian manifold, Li C Ai an open set and X : —)• 
TM. an analytic vector field. Let po G Ai be such that VX(po) is nonsingular and set fi := 
||VX(po)"^-’^(po)||- Suppose 


a 


:= /37 < 3 - 2V2, 


7 := sup 

n>l 


1 


n! 


VX(po)-'V"X(po) 




< 00 , 


B{pq, R) C Ll, where R\= {1 — l/V2)/7. Let 0 < /? < [3 — 2y/2 — a]/( 27 ) and 

t*,p '■= + 1 - 2p7 - ^(a + 1 - 2 pqfi - 8 a - 8 pq{l - a)^ /(47). 

Then the sequences generated by Newton’s method for solving the equations X{p) = 0 with starting 
at qo, for any qo G B\po,p], 


qk+i = expg,^{-XX{qk) ^X{qk)), 


k = 0 , 1 ,-. 


are well defined, {qk} is contained in i?[goV*,p] satisfy the inequality 

7 „ ^2 


d{qk,qk+i) < 


(1 - l{U,p + P))[ 2 (l - l{U,p + p)Y - 1 ] 


d{qk-i,qky 


k = 1 , 2 ,... 


Moreover, {qk} converges to p<f G i?[poV*,o] such that X{pfi) = 0 and the convergence is Q-quadratic 
as follows 

d{qk+i,p*) 7 

iim sup —-- 7 ^ < --^----;-t-t; -r. 

fc^oo d{qk,p*) (1-7(t*,p + p))[2(l-7(t*,p + p ))2 - 1] 

Furthermore, p* is the unique singularity of X in B {pq,R) C fl. 

We need the following results to prove the above theorem. 

Lemma 11. Let Ai be an analytic Riemannian manifold, Ll F Ai an open set and X : LI ^ TAi 
an analytic vector field. Suppose that pQ G fl, VX(po) is nonsingular and that R< {1 — 1/V2)7“^. 
Then, for all Q G Gfipo,R) there holds 


\\VX{po)-^Pc,s,o^^XiCis)))\\ < {2q)/il-qi[C,0,s]y 
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Proof. The proof follows the same pattern of Lemma 5.3 of [ 3 ]. 


□ 


Lemma 12. Let Ai be an analytic Riemannian manifolds, Ll Q Ai an open set and X : Ll ^ TAi 
an analytic vector field. Suppose that po £ Ll and VX{pq) is nonsingular. If there exists an 
/ : [0, i?) —>• R twice continuously differentiable such that 

||VX(po)-'Tc,^,oV2^(C(s)))|| ^ f"{i[C,0,s]), (50) 

for all C € G 3 {po,R) and for all s G dom{C), then X and f satisfy (l5|) with n = 3. 

Proof. Let C be a curve of Q 3 {pq,R), a,b £ dom((() with 0 < a < 6. Prom Definition 0] there exist 
co,ci,C2,C3 £ [0,T] with cq = 0 < ci < C2 < C3 = T such that and are minimizing 

geodesics and is a geodesic. We have six possibilities: 

• a, b £ [a, Cj+i] for i = 0,1, 2; 

• a£ [ci, Ci+i] and b £ [q+i, Ci+2] for i = 0,1; 

• a £ [co, Cl] and b £ [c2, C3]. 

We are going to analyze the possibility a £ [cq, ci] and b £ [c2, C3], the others are similar. Since 
a £ [co, Cl] and is geodesic, taking v £ T^(a)Al and Y £ X{Ai) the vector field on C, such 

that V^/(s)y = 0 and y(C(a)) = v, we may apply Lemma[2]to have 

^C,ci,aVX(C(ci)) F(C(ci)) = VX(C(a))Y(C(a)) + H P^,s,a^^X{Cis)) (y(C(s)), C'(5)) ds. (51) 

J a 

Using that Y{C{a)) = v and y((C(ci)) = we obtain, after some algebraic manipulation in 

last equality, that 


VX(po)-' [Pc,ci,oVX(C(ci))Pc,„,,i - Pc,„,oVX(C(a))] u = 

r\x{po)-^Pc,s,o^^X{C{s)) {Y{as)),C'{s)) ds. 
J a 

Since llT(C(s))jj = jjujj for all s £ [a, ci] and c is a arbitrary, we conclude from Dehnition [3] that 


||VX(po)-' [Pc,ci,oVX(C(ci))Pc,a,ci - Pc,a,oVX(C(a))]|| < 

rilVX(po)-'Tc,«,oV2x(C(s)))llllC'(5)||d5. 

J a 

Now, as llC^(s)ll = .^[C,a,ci]/(ci — a) and £[C,0, s] = .^[C,0,a] + ((ci — s)/(ci — a))i[C,a,ci] < R for 
all s £ [o,Cl], using (f50]l we obtain, from the last inequality, that 


VX(po)-M^’c.ci,oVX(C(ci))Pc,„,ei 


Pc,a,oVX(C(a))]|| < 

J f” 0, a] + _ ^ ^[C, a, Ci]^ 


Cl — a 


Evaluating the latter integral, it follows that 


VX(po)-' XX(f{c3))Pi^a,c, - P5,a,oVX(C(a))] II < f {I[i, 0, Cl]) - f (£[C, 0, a]) . (52) 
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On the other hand, using that is geodesic, similar arguments used above show that 

II VX(po)-' [Pi,c,,o VX{ac2))Pi,c,,c, - P«,ci,oVX(e(ci))] II < f 0, C2]) - /' 0, Cl]). (53) 

We may also use that 6 s [c2, C3] and is geodesic to obtain the following inequality 

||VX(po)-' [Pii,b,oVX{m)Pi,c 2 ,b- Pa,c,,o^X{ac 2 ))]\\ < f m,0,b]) - f {i[^,0,C2]). (54) 

Now, taking into account that the parallel transport is an isometry, the triangular inequality yields 

||VW(po)"^ [Pa,b,o^X{m)P^,a,b - P5,a,oVW(e(a))]|| < 

||VX(po)-' [Pa,b,o^X{m)PLc2,b - P€,c„oVX(e(c2))]|| + 

||vw(po)-' [Pa,c 2 fl'^x{ac 2 ))Pa,cuc 2 - Ps.ci,oVW(C(ci))]|| + 

||vx(po)-' [Pa,cuo^x{aci))Pa,a,c, - P^,a,oVX(e(a))]||. 

Combining last inequality with (f5^ . (l53|) and it follows that 

II VX(po)-' [Pa,b,o VX{m)Pa,a,b - Pc,a,oVW(C(a))] II < /' {i[i, 0,6]) - /' {i[i, 0, a]), 

which is the desired resnlt. □ 

Proof of Theorem 4. Since a < 3 — 2\/2, combining Lemma [TT] and Lemma [T2] we have that 
the analytic function / : [0, i?) —)• M dehned by f{t) = /? — 2t + t/(l — yt) is a majorant function to 
X with respect to Qz{Po^ P)- Hence, the proof follows from Theorem [T] with L = (3 —2\/2 —o)/7. □ 


5.3 Under Nesterov-Nemiroviskii’s condition 

Theorem 5. Let C C M"' he a open convex set and F : C be a strictly convex function, three 
times continuously differentiable. Take xq G C with F"{xo) nonsingular. Define the norm 

ll’^^jjxo •— \/ {u, u )xq ) V u S M , 

where {u,v)xq = a~^{F"{xo)u,v) for all u,v G ffi"" and some a > 0. Suppose that F is a-self- 
concordant, i.e., satisfies 

\F'"{x)[h,h,h]\ < 2a-^/'^{F''{x)[h,h]f/^, V x G C, h G M^, 

Wi{xo) = {x G M”' : jjx — xolUo < 1} C C and there exists f3 > 0 such that 

||F"(xo)-^U(xo)|Uo </3 <3-2^2. 

Let 0 < p < (3 —2\/2 —/3)/2 and t^,,p := (^ + \ — 2p — y^(a + 1 — 2pY — So — 8/9(1 — a)^ /4. Then 
the sequences generated by Newton’s method for solving the equations F'{x) = 0 with starting at yo, 
for any yo G VLp[xo] = {x G M"' : jjx - xqIUo < p}, 

yk+i = yk- F''{yk)~^F'{yk), A: = 0 ,1,... 

is well defined, {yk} is contained in Wt*,p[xo] = {x G M"" : jjx — xojjxo < A*,p} and satisfy the 
inequality 

lll/fc+i ~ J/fcll < wj T, ~ vTfoTi 7+ T vi 2 ~ J/fc-ill ) k = 1,2,.... 

(1 - {U,p + p))[2{l - [U,p + p)Y - IJ 
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Moreover, {yk} converges to x* G Wt^ o[*o] such that F'{x^,) = 0 and the convergence is Q-quadratic 
as follows 

]• 11^* yfc+ill ^ ^ 

iimSUp-7-^ ^-;- ,, , , -;--77^- 7- 

fc^oo \\x*-yk\\^ (1 - (t*,p + p))[2(l - (t*,p + /9))2 - 1] 

Proof. Since a < 3 — 2y/2, combining Lemma 5.1 of [3] and Lemma [l2] we have that the function 
/ : [0, i?) — )• M defined by f{t) = (3 — 2t + t/{l — t) is a majorant function to F' with respect to 
03 (xq, R). Hence, the proof follows from Theorem [T] with L = 3 — 2\/2 — b. □ 

6 Final remark 

Let us present some computational aspects of Newton’s method in Riemmanin settings for solving 
the equation (H]). Note that the first equality in Q is equivalent to 

Qk+i = expq^Sk, VX{qk)~^Sk =-X{qk), A; = 0,1,.... (55) 

Since the solution of the linear systems in (|55p for large systems is computationally expensive, 
namely, at each iteration the derivative at qk must be computed and stored. Besides, the solution 
of the linear system in ()55p is required. To circumvent these drawbacks, we propose the inexact 
Newton’s method: given an initial point qo, the method generates a sequence {qk} as follows: 

qk+i = expq^Sk, XX{qk)~^Sk =-X{qk)+rk, \\rk\\ < 0k\\X{qk)\\ k = 0,l,.... 

for a suitable forcing sequence {6k}, which is used to control the level of accuracy. Therefore, solu¬ 
tions of practical problems are obtained by computational implementations of the inexact Newton¬ 
like methods. The analysis of these methods under majorant condition will be done in the near 
future. 
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